CDS

Accession Number TCMCG052C13949
gbkey CDS
Protein Id CAB4277989.1
Location complement(join(16331400..16331459,16331553..16331719,16331998..16332103,16332681..16332718,16332836..16332923,16333394..16333447,16333542..16333650,16333826..16333908,16334092..16334154,16334247..16334309,16334796..16334894,16335082..16335120,16335210..16335265,16335750..16335864,16335989..16336158,16336724..16336827,16337133..16337227,16337614..16337696,16337812..16337959))
Organism Prunus armeniaca
locus_tag CURHAP_LOCUS28159

Protein

Length 579aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJEB37669, BioSample:SAMEA6812185
db_source embl accession CAEKDK010000004.1
Definition unnamed protein product [Prunus armeniaca]
Locus_tag CURHAP_LOCUS28159

EGGNOG-MAPPER Annotation

COG_category E
Description Belongs to the HisA HisF family
KEGG_TC -
KEGG_Module M00026        [VIEW IN KEGG]
KEGG_Reaction R04558        [VIEW IN KEGG]
KEGG_rclass RC00010        [VIEW IN KEGG]
RC01190        [VIEW IN KEGG]
RC01943        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01663        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00340        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
ko01230        [VIEW IN KEGG]
map00340        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
map01230        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGAGGCGCCGCCATTTGCTTCTTCTACCAAAACGCTGTCGTTTCGATCATCGTCCCCCACGTCTCTTCTTTTTCTTCGCAAAAATCGCCTCAGTTTCAGACCTTCCAGAAACTTCTGCGTCCGTGCCTCCTCTGCCGGTGACTCCGTGGTGACTCTGCTTGACTACGGTGCCGGCAATGTTCGTAGTGTGAGGAATGCCATTCGCCACCTCGGCTTCGACGTTAAAGATGTTCAAACTCCAGAAGACATTCTCAACGCCAACCGCCTAGTTTTTCCTGGAGTGGGGGCTTTTGCTGCGGCCATGGATGTGCTGAATAAGAATGGGATGGCTGAAGCACTCTGTTCATATATTGAGAAGGACCGACCATTTCTAGGCATTTGTCTGGGTCTTCAGCTCCTTTTTGAATCCAGTGAAGAGAAAGGACCAGTGAAAGGTCTTGGCTTGATACCGGGAGTGGTTGGGCGTTTTGATTCATCAAATGGTTTCAGAGTTCCACACATTGGATGGAATGCTTTGCAGATTAGAAAGGACTCATTAATTTTGGATGATGTTGGAAGCAATCATGTCTATTTTGTTCATTCTTACAGAGCCATGCCCTCAGATGAAAACAACGAATGGGTTTCATCTACTTGCAACTATGGTGACAATTTTATTGCGTCCGTTAGAAGGGGAAATGTGCATGCAGTTCAATTTCACCCGGAAAAGAGTGGAGATGTTGGTCTTTCAATATTGAGAAGATTTTTGTATCCAAAGGCACAGTTGACAAAGAAGCCCACTGAAAGGAAGGCTTTGAAACTTGCAAAGAGGGTGATTGCTTGTCTTGATGTGAGGACAAATGACAAAGGAGATCTTGTTGTAACCAAAGGCGACCAATACGATGTAAGAGAGCATACAAAAGAGAATGAGGTGAGAGAACTTGGCAAGCCTGTGGAGCTGGCTCGACAGTATTACAAAGATGGGGCAGATGAGGTCAGTTTTTTAAATATTACCGGTTTCCGCGACTTCCCTTTGGGCGACTTGCCCATGTTACAGGTACTGAGATACACATCAGAAAATGTTTTTGTACCATTAACAGTTGGAGGTGGCATTAGAGATTTTACAGATGCTAATGGCAGGAAGTATTCTAGTTTGGAAGTTGCTTCAGAATATTTCAGATGTGGGGCTGATAAGATTTCCATTGGGAGTGATGCAGTTTATGCTGCAGAAGAATATTTAAGAACTGGAGTAAAAAGTGGAAATAGTAGCTTAGAGCAGATATCTAGAGTTTATGGAAATCAGGCTGTGGTTGTAAGCATTGATCCTCGCAGAGTGTACCTTAAAAATCCAGAGGATGTAGGGTTCAAGACTATTAGGGTAACAAACCCAGGTCCAAACGGAGAGGAATTTGCGTGGTATCAGTGTACAGTTAGCGGTGGCCGAGAAGGCCGACCAATTGGAGCTTATGAGCTTGCAAAAGCAGTTGAGGAGCTAGGAGCTGGAGAAATACTGCTAAACTGCATTGATTGTGATGGTCAAAAGAAGGGATTTGATATAGATTTAATAAAGCTGATCTCAGATGCTGTGGGCATTCCCGTGATTGCTAGTAGTGGTGCTGGTTCTGCTGAACACTTCTCAGAGGTGTTCAGGAAAACAAATGCATCTGCTGCCCTTGCTGCAGGCATTTTCCATCGGAAGGAGGTGCCTATTCAATCTGTAAAGGAGCATTTGTTAAATGAAGGCATAGAAGTCAGAATCTGA
Protein:  
MEAPPFASSTKTLSFRSSSPTSLLFLRKNRLSFRPSRNFCVRASSAGDSVVTLLDYGAGNVRSVRNAIRHLGFDVKDVQTPEDILNANRLVFPGVGAFAAAMDVLNKNGMAEALCSYIEKDRPFLGICLGLQLLFESSEEKGPVKGLGLIPGVVGRFDSSNGFRVPHIGWNALQIRKDSLILDDVGSNHVYFVHSYRAMPSDENNEWVSSTCNYGDNFIASVRRGNVHAVQFHPEKSGDVGLSILRRFLYPKAQLTKKPTERKALKLAKRVIACLDVRTNDKGDLVVTKGDQYDVREHTKENEVRELGKPVELARQYYKDGADEVSFLNITGFRDFPLGDLPMLQVLRYTSENVFVPLTVGGGIRDFTDANGRKYSSLEVASEYFRCGADKISIGSDAVYAAEEYLRTGVKSGNSSLEQISRVYGNQAVVVSIDPRRVYLKNPEDVGFKTIRVTNPGPNGEEFAWYQCTVSGGREGRPIGAYELAKAVEELGAGEILLNCIDCDGQKKGFDIDLIKLISDAVGIPVIASSGAGSAEHFSEVFRKTNASAALAAGIFHRKEVPIQSVKEHLLNEGIEVRI